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Abstract. - We analytically explore the scaling properties of a general class of nested subgraphs 
in complex networks, which includes the K-core and the A-scaffold, among others. We name such 
class of subgraphs A-nested subgraphs due to the fact that they generate families of subgraphs 
such that ...Sk+i(G) C Sk{Q) C Sk-i{G)---- Using the so-called configuration model it is shown 
that any family of nested subgraphs over a network with diverging second moment and finite first 
moment has infinite elements (i.e. lacking a percolation threshold). Moreover, for a scale-free 
network with the above properties, we show that any nested family of subgraphs is self-similar by 
looking at the degree distribution. Both numerical simulations and real data are analyzed and 
display good agreement with our theoretical predictions. 
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Introduction. — The internal organization of most 
complex systems displays some sort of nestedness asso- 
ciated to some type of hierarchical organization. Such 
patterns can be detected by using appropriate theoretical 
tools which help us understanding the system's structure 
in terms of a network [1-7]. Furthermore, the structure 
of such communities can provide us valuable information 
about invariant properties and potential univcrsals. In 
this work we will define a general class of network sub- 
structure which we called A'— nested subgraph. Such class 
df subgraphs includes the K-core, the A-scaffold or the 
random deletion of nodes. But it also includes any other 
substructure you can define, if it holds a small set of prob- 
abilistic restrictions. We develop a general, unified frame- 
work that enables us to study generic properties of such 
K— nested subgraphs. As we should see, the most common 
class of real networks, those with connectivity patterns fol- 
lowing a power-law distribution P(k) oc k~ a , 2 > a > 3, 
have very interesting properties when looking to subgraph 
nestedness. In this context, theoretical studies on the re- 
silience of both A-cores [4, 8] and A-scaffolds [6, 9] sug- 
gest that arbitrary large scale-free networks contain infi- 
nite, asymptotically self-similar, A-cores and A'-scaffolds, 
indicating that such subgraphs are highly robust against 
random deletion of nodes. Metaphorically, it has been 
suggested that the structure of complex nets is similar to 



a Russian doll [4]. 

These results are consistent with the mounting evidence 
indicating that scale-free networks exhibit general self- 
similar properties [4,10-13]. From the physical point of 
view, the assymptotical invariance of the degree distribu- 
tion of scale-free nets under nesting operations is one of 
their most salient properties. At the theoretical level, the 
conservation of P(k) the degree distribution implies self- 
similarity, as far as most of the properties of a random 
graph are determined by its degree distribution [14]. Of 
course, real nets are not exactly random graphs, but such 
approach revealed surprisingly adequate to study real sys- 
tems [15]. Furthermore, self-similar properties and scaling 
laws might be an indication that such objects are orga- 
nized near criticality [16, 17]. 

In this letter wc generalize previous approaches, showing 
that any nested family of subgraphs of a given scale free 
network has an infinite percolation threshold i.e., there is 
an infinite set of Russian dolls for such networks. More- 
over, it can be shown that such families are self-similar. 
We develop such concepts under the framework of the so- 
called configuration model [18], which works on an ensem- 
ble of arbitrarily large, sparse and uncorrelated graphs 
with specific properties. The remaining of the paper is 
organized as follows: First, we formally define the con- 
cept of K-nested subgraph and we show how the above 
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Fig. 1: Some subgraphs samples that enable us to define a 
nested family of subgraphs. In the original graph (left) we 
shadowed the nodes that disappear under the operation of Sk- 
In the right-hand side, we display the giant component of the 
obtained graph, Sk- We find the isT-scaffold, (K = 3) (a). 
The if-scaffold is the subgraph obtained by choosing all the 
nodes whose connectivity is equal or higher than K and all the 
nodes connected to them. Such a subgraph enables to study the 
fundamental hub-connector structure of the complex networks, 
(b) The K-core (K — 3), the largest induced subgraph whose 
minimal connectivity is equal to K. (c) A subgraph obtained 
by randomly deleting a fraction (v = 5/21) of nodes (commonly 
referred by the literature as random failures.) 

mentioned examples hold the required conditions. Then, 
we derive the general percolation properties and the final, 
generic form, of an arbitrary nested subgraph of a given 
net. From the developed formalism, we apply our results 
to specific network topologies. 

Nested Subgraphs. — Formally, a complex network 
is topologically described by a graph Q(V,T) where V is 
the set of nodes and r : V — > V the set of edges connecting 
nodes of V. If P(k) is the probability that a randomly 
chosen node is connected to k other nodes, then 

oo oo 

(k)=J2kP(k) <fc 2 H£fc 2 P(fc) 

k k 

is the average connectivity of Q and the second moment 
of the distribution, respectively. 

We will say that S(A, Ta) is an induced subgraph of 
G(V,T) if A C V and T A C T, being T A a mapping 
: A — > A. We can define many subgraphs from a 
given graph. Here we are interested in a special set of 
subgraphs, hereafter K -nested subgraphs, which includes, 
as special cases, the family of successive K-coies or K- 
scaffolds and the so called v— deletion graph, obtained by 




Fig. 2: A complex network with broad distribution of links (a) 
and two nested subgraphs: (b) Its _R"-core (K = 4) and (c) the 
corresponding iC-scaffold (K = 20) 

deleting a fraction v of nodes. A if-nested family of sub- 
graphs M is a collection of subgraphs of a given graph Q, 

N = {Si(g),s 2 (g),...,Si(g),...} such that: 

...Sk+i{Q) QS k (G) CSk-i(S)... (i) 

For every family of .K'-nested subgraphs we associate a 
nesting function tpx(k), namely the probability for a ran- 
domly chosen node with degree k to belong to Sk- If 
U C R is a set that depends on the nature of the nesting, 
fK{k) is such that: 

ip K {k) : U x [0,1] (2) 

It is easy to see that, for a function to be a nesting func- 
tion, it has to fulfill the following logical conditions: 

{cp K (k') > y K {k)) (fc' > k) (3) 

(<PK>(k)><p K (k))=>{K' <K) (4) 
(V^)(3A Sk g (0, 1])|( lim VK {h)) = X Sk ) (5) 

k — >oc 

where Xs K is a scalar whose value will depend on the ex- 
plicit form of Sk- In short, tpK^k) is a non-decreasing 
function on k (eq. (3)) and a non increasing function on 
K (eq. (4)). Note that such a function implies that all 
the nodes satisfying the conditions are taken into account: 
Our subgraphs are maximal under the conditions imposed 
by ifK- Furthermore, note that, for a fixed K, (fK{k) has 
an horizontal asymptote at ipK(k) = Xs K (eq. (5)). Thus: 

]im((p K (k + l)-<p K (k))=0 (6) 

k — >oc 

From (3, 4, 5, 6) we can see that, for a fixed K, and 
< 5 < 1 there exist a k* such that: 

(Vk^k, > k*) (\\cp K (ki) - f K (k 3 )\\ < S) (7) 

and we can conclude that the sequence {<fK(k)} = 
if k (1) , ifK (2), (pK(i), ■■■ is a Cauchy sequence. As we 
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should see, this property will be useful in the following 
sections. Let us now explore some relevant nesting func- 
tions. 

a) K-core subgraphs. The if-corc is the largest induced 
subgraph whose minimal connectivity is K . Intuitively, it 
is clear that a collection of if-cores from a given graph Q 
defines a nested family of subgraphs. Within the config- 
uration model, we can informally identify the probability 
for a given node of Q to belong to the giant .fT-corc with 
the probability to belong to an infinite [K — l)-ary sub- 
tree of Q [4,8, 19]. Therefore, the probability for a given 
node to belong to the K-cove equals to the probability of 
belonging to an infinite (K — l)-ary subtree. Let R be the 
probability that a given end of an edge is not the root of 
an infinite [K — l)-ary subtree. The associated nesting 
function for the iC-core is (fx(k) = 0, if k < K and 

= £ ( k )R k -\l-R) 1 (8) 

i=K W 

otherwise. It is straightforward to check that such a func- 
tion follows (3, 4, 5). 

h)K -scaffold subgraphs The if-scaffold of a given graph 
is the subgraph obtained by choosing all the nodes whose 
k > K and the nodes that, despite their connectivity is 
k < K, they are connected to a node e' whose k' > K 
[6,9]. The nesting function for the if -scaffold is <pic{k) = 
1, if k > K and 



<PK(k) = 1 - 




otherwise. Note that, for both the A'-nested families of 
.FT-scaffolds and if-cores, Xs K = 1. A variety of sub- 
graphs can be defined from the if-scaffold, such as the 
naked if-scaffold (a subgraph obtained by cutting all the 
nodes whose degree is k = 1 in the if-scaffold). 

c) Random deletion of nodes.- Suppose we delete a frac- 
tion v = 1 — v of nodes from our graph. Such an operation 
can be also formalized in terms of nesting functions. For 
the sake of simplicity, if we are performing a random dele- 
tion of a fraction of nodes from Q, we will indicate the 
nesting function and the subgraphs as (p v and S v , respec- 
tively. The associated nesting function is, simply: 

(Vfc)(^(fc) = v) (10) 

For mathematical purposes, let us introduce an addi- 
tional class of subgraphs, Sx-y, of a given subgraph Sk- 
The main feature of such subgraphs is that Sk*y Q Sk- We 
name such subgraphs minor subgraphs of Sk- To charac- 
terize such subgraphs, we say that jK(k) is a minor nesting 
function of <fiK(k) if (7if(fc) < /rt(A;)) for all k. Given an 
arbitrary ipx{k), we can build a minor nesting function as 
follows: Let k' be the minimum k such that (pxik') ^= (it 
could be k 1 = 1). Then find an e > such that e < tpK(k')- 
Thus, 

f Qifk<k' 
T*( fc ) = ( eifk>k> ( U ) 



This trivial way to define a minor subgraph from a given 
subgraph Sk is enough, since both jK(k) and tpK{k) ver- 
ifie (3,4,5). Moreover, it is clear that 1 (Sr- 7 C Sk) for all 
K. 

Percolation of nested subgraphs. Previous to 
determining the specific statistical properties of the ob- 
tained subgraphs, we are interested in knowing whether 
there is a giant component in Sk, i.e., if the operation of 
nesting breaks (or not) the initial graph Q into many small 
components. We consider first the general problem. 

Let us define the generating functions for an arbitrary 
A"-ncsted subgraph with an associated nesting function 
ifK{k) defined on Q with arbitrary (but smooth) degree 
distribution P(k). 

oo 

F (z)=J2P(k)^K(k)z k (12) 

k 

1 oo 

Fx(z) = -^J2 kP ( k ^ k > k ' 1 ( 13 ) 

The averages -i.e., the values at z = 1 of eqs. (5) and (6)- 
are, respectively, /i = Fq(1) and oj = -Fi(l). Here, fi is the 
fraction of nodes from Q that belong to Sk ■ Similarly, oj is 
the relation among (k) and the average number of nodes 
from V reachable after computing the nested subgraph. 
The generating function for the size of components -other 
than the giant component- which can be reached from a 
randomly chosen node is: 

ffi(z) = 1 - w + zFi(Hi(z)) (14) 

and the generating function for the size of the component 
to which a randomly chosen node belongs to is [15,20]: 

H (z) = l- t i + zF {H 1 (z)) (15) 

thus, the average component size other than the giant 
component is: 

( S )=^(1)= M + ^(1K(1) (16) 

If we compute the derivative, it is straightforward to see 
that it leads to a singularity when F[(l) = 1. Thus, if 
F{(1) = t|t J2k k{k — l)fK{k)P(k), to ensure the presence 
of a giant Sk, the following inequality has to hold: 

k{k - 2)P(k) >Y J k ( k - l)(pK{k)P{k) (17) 

k k 

Where 0k (k) = 1 — ipK(k). This can be seen as the nat- 
ural extension of the Molloy and Reed criterion [21] for 

1 ~Let us suppose a graph Q and two subgraphs of it, S,s,S v i, ob- 
tained by deleting at random v = 1 — v and v' = l — u', with v' > v. 
Clearly, we cannot conclude that S v is an induced subgraph of S v i . 
But it is true that the properties of S v will be, with high proba- 
bility, the properties of some induced subgraph of S v i obtained by 
deleting at random v nodes of Q. Furthermore, it can be shown that 
the probability to find a diverging value decays exponentially with 
the size of the system -Recall that we are working with an ensemble 
formalism. 
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Fig. 3: The simplest family of nested subgraphs, obtained 
by removing all nodes whose connectivity is less than K: 
ip K (k) = @(K,k), where 6(K,k) = 1 iff k > K and oth- 
erwise, (a) Numerical computation of the size of the giant 
component = 1 — Ho(l) = Li — Fo(u) where it is the first, 
non trivial solution of u = 1 — lj + Fi(u), for ipii(k) = &(K, k). 
This curve corresponds to a scale-free network with a ~ 2.15. 
No specific scale is identified. The sharp decay for the large 
K values can be attributed to the finite size of the system (In 
this simulation, we assumed k max ~ 5000 ). (b) The same 
computation over an Erdos Renyi graph with (k) — 30 dis- 
plays a clear characteristic scale where the giant component is 
completely eliminated. 



We can rewrite Ps K as: 



^ i>k 



But note that, due to relation (7), for large fc's: 
\Lo k (l-u;y- k P(i)^5(Lo,\ SK ) 



-W \ ^ 

p i>k 



(21) 
(22) 



Thus Ps K is reduced to: 



h 1 



>k 



u k {l-Lo) l - k P{i) (23) 



Let us rewrite equation (23) in order to extract analyti- 
cal results. If the first generating function of the degree 
distribution of Q, without taking into account the nesting 
operation, is: 



(24) 



any nested subgraph Sk, with associated nesting function 
tpx^k). A more compact expression of such a criterion is: 



k 2 ip K {k)P(k) - (1 + Lu){k) > 



(18) 



Degree distribution of Sk- — The next step is to 
compute the degree distribution of the nested subgraphs, 
Ps K (k) . The key question is finding the average num- 
ber of nodes a given node will reach, if it survived to the 
computation of Sk- Taking into account the set of all 
nodes of Q, the average connectivity will decrease a factor 
to = Fi(l) = l/(k) x T,k kl PK(k)P(k). Clearly, the prob- 
ability for a surviving node whith connectivity k in Q to 
display connectivity k' < k in Sk, P(& — ^ k'), is: 



P(fc -► k') = 



uj k '(l -w) 



k-k' 



(19) 



And, in absence of correlations, a node whith connectivity 
k in Q now will survive with a probability ipK(k) and it 
will be connected, on average, to mk nodes. If we take into 
account all the possible contributions of the nodes of Q to 
the abundance of nodes with certain degree k in Sk, we 
have: 



1 / \ 

p SK (k) = . <» k (i - ^y k p( 



i>k 



(20) 



Where P$ K (k) is the probability to find a node of degree 
k after the computation of Sk- Note that the factor — 
normalizes Ps K (k). Clearly, if we define 5(w, \s K ) as: 



i>k 



It is straightforward that: 



dx 1 



co(*) = E(^« 



i—k 



i>k 



(25) 



Thus, we can rewrite the degree distribution (23) in terms 
of the derivatives of Gq(z): 



PsAk) 



X 



Sk ^ 



[l k\ dz k 



G (z) 



(26) 



In the following, we will apply our results to standard 
topologies of network theory: The Erdos Renyi graphs 
and the Power-law graphs. 



Erdos Renyi Graphs. 

graph, 

P(k) 



In the Erdos Renyi (E-R) 



{k) k e 



k p (k) 



Id 



(27) 



and (k 2 ) = (k) 2 . To study specifical percolation pre- 
operties, we need to know the specific shape of tpK(k). 
In (fig.3-b)) we approached numerically the size of the 
giant component in an E-R graph where a nesting suc- 
cessive nesting operation is performed. A clear thresh- 
old is observed, displaying a critical point where the gi- 
ant connected component is completely eliminated. The 
special case of tpK{k) = v recovers the well-known per- 
colation condition for E-R graphs under random damage, 
(k) > (1 + v)/v. The predictions for the degree distribu- 
tion are more general and accurate. Indeed, the expression 
for G (z) in E-R graphs is G% R (z) = e^^- 1 ). Thus, if, 
as we defined above, fx = F (l) : 



Ps E K R (k) 



X Sk {tok) k e^ 



k\ 



(28) 



T»_/1 
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Fig. 4: Analyzing the web obtained from the O. Wilde's novel 
The portrait of Dorian Gray. The network was built up by 
tracing an arc between two adjacent words, if they appear 
one after the other within the same sentence. The obtained 
graph has N = 5696 nodes and displays a scale-free distribu- 
tion P(k) oc k~ a (grey circles), with an exponential cut-off 
at high connectivities [k > fOOO). In this graph, a « 2.15 
and (k) w 8.814 . We plot the cumulative frequency for the 
A-cores, 4 > A > 11 (left). Despite the strong connectivity 
requeriments imposed for the A-core, the distribution behaves 
as an statistical invariant. The same is observed with succes- 
sive naked A-scaffold subgraphs, A = 14, 16, 18, 20, 22, 30, 40 
(right). The naked A'— scaffold subgraph is obtained from the 
A-scaffold but deleting all the nodes with k < A that are 
connected only to one node with k' > K. 



This implies that, for large k's, the nesting operation over 
an E-R graph results in an E-R graph but with a factor u 
correcting the mean value, whose value goes from (k) — > 

Uj(k). 

Scale-free nets. — Let us assume a scale-free network 
with 

P(k) oc k~ a (29) 

with scaling exponent 2 < a < 3. We will show that, 
at the thermodynamic limit, any family of subgraphs has 
infinite subgraphs. This has been shown separately for 
the AT-core [4,8] and the AT-scaffold [6]. One of the main 
characteristics of such nets is that (k 2 ) — > oo, and that (k) 
does not diverge with network size. 

What we should prove is that, under these conditions, 
relation (18) holds for all K's. In other words, there is 
no characteristic scale for the substructure generated by 
ipxik). Indeed, our subgraphs need to fulfill the inequal- 
ity: 

\k 2 ip K {k)P{k) - (1 +cu)(k) > (30) 



But we cannot work directly with an arbitrary nesting 
function ipx- Thus, to prove the above claim, we build a 
minor nesting function 7k"(&) of our ipn{k) 1 as defined in 
(11), assuming k' as the smallest k such that (fic(k) > 0. 
Thus, if lo 1 = F^(l) has the form: 



= e 1- 



V kP(k)\ _ 
^ (k) J ~ 



k<k 



(31) 



The corresponding percolation condition for Sk^ is, thus: 
e k2p ( k ) - (l + OW > 

k>k' 

But since (k 2 ) diverges, we will have tJ2k>k' k 2 P(k) — > oo 
and condition (18) always holds, provided that (k) is finite. 
This implies that percolation of any nested subgraph of 
an arbitrary large scale-free network is guaranteed, as far 
as 5fc Q Sk- Numerical simulations (see (fig3-a)) of 
the size of the giant component display no critical scale 
for the emergence (elimination) of the Giant connected 
component. 

The above mathematical machinery will lead us to 
demonstrate that our families of nested subgraphs exhibit 
invariance in degree distribution. If we put the distribu- 



tion P(k) 
to: 



C~ l k- 



(C = C( a ))j equation (26) becomes 



Ps K (k)n\ SK 



G S o F (z) 



k\ dz 



(32) 



Thus the problem lies on finding the k-th derivative of 
Gq F (z). The computation is slightly more complex than 
the E-R graphs, and involves some approaches. First, 
we compute the generating function for an scale-free net 
P(k) = C _1 /c _Q whose exponent lies between 2 and 3, 

<#*(*): 



G% F (z)=C- 1 U a (z) 



dt- 



T(a) 

Where Li a (z) = f« ^ s * ne polylogarithm function 
and, to obtain the last step, we used its integral form. But, 
actually, we are interested in the derivatives of Gq F (z). If 
we assume z — > 
approached by: 



1 the k — th derivative of Gq(z) can be 



W) 
, kl 



dt- 



= C 



_iAr!r' 



T(a) 



dt- 



- z) k + l 
t a-l 



T(a) 



- r 

,a-l 



k+l 



dy 



Where 
1, 



z 



in the first approach, we used the fact that, if 
we are near a singularity when t — > 0. Thus, the 
dominant terms of the sum will be those close to t = 0. 
This enables us to rewrite e* « 1 + t + 0(t 2 ). In the last 
step, we made the coordinate change r = 1 — z and, then, 
t = yr. If we evaluate such an expression at z = 1 — w, 
with lu small enough: 



jk 



-l-k 



L — ~ ; Jfe+l,a+l 



T(a) 

Where Jfe+i, a +i is defined as: 

■ a ~ 1 r(a)r(fc-a + l) 



l,a+l 



dy 



o tf (y + l) fe+1 k\(k-a + 2) 
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If we check the behavior of Jk+i,a+i for large k's, we see 
that: 

Jfc+i.a+i ~ —j—- (33) 

Thus, if we introduce the above results into the definition 
ofPs K : 

= C- 1 ^jL L o a - 1 k- a (34) 

Which can be rewritten in the standard form when de- 
scribing of self-similar objects: 

Ps K (k)^p- a P(k)=P(pk) (35) 

Where p is a constant that, interestingly, depends both 
with the scaling exponent a and the nature of the nesting, 
namely: 

Discussion. — Many interacting systems found in 
nature display a scale-free topology, P(k) oc k~ a , with 
2 < a < 3. In this letter we have shown that the assump- 
tions of the configuration model are enough to explain 
many of the scaling and self-similar properties of the ob- 
served nested subgraphs nets. The resulting prediction 
(35) reveals that, under no correlations, we should expect 
invariance in degree distributions of nested subgraphs to 
occur. This is what we observe in the analysis of real 
nets (see fig. (4)). Indeed, in the analysis of the degree 
frequency we see that, despite the finite size of our sys- 
tem, the degree frequency acts as an invariant, only mod- 
ulated by an scaling factor. These results contrast with 
previous work on sampled subnets obtained from scale- 
free graphs [22]. Although is true that arbitrary subsets of 
nodes might not display invariance, our families of nested 
subgraphs are defined in such a way that our results are 
expected to hold. Further work should address the impact 
of the self-similarity in the functional aspects of the net, 
as well as a broader study of nested subgraphs involving 
different types of real networks. 
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